A Probabilistic Multimedia Retrieval Model and Its Evaluation

نویسندگان

  • Thijs Westerveld
  • Arjen P. de Vries
  • Alex van Ballegooij
  • Franciska de Jong
  • Djoerd Hiemstra
چکیده

We present a probabilistic model for the retrieval of multimodal documents. The model is based on Bayesian decision theory and combines models for text-based search with models for visual search. The textual model is based on the language modelling approach to text retrieval, and the visual information is modelled as a mixture of Gaussian densities. Both models have proved successful on various standard retrieval tasks. We evaluate the multimodal model on the search task of TREC’s video track. We found that the disclosure of video material based on visual information only is still too difficult. Even with purely visual information needs, text-based retrieval still outperforms visual approaches. The probabilistic model is useful for text, visual, and multimedia retrieval. Unfortunately, simplifying assumptions that reduce its computational complexity degrade retrieval effectiveness. Regarding the question whether the model can effectively combine information from different modalities, we conclude that whenever both modalities yield reasonable scores, a combined run outperforms the individual runs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval

In recent years, the multimedia retrieval community is gradually shifting its emphasis from analyzing one media source at a time to exploring the opportunities of combining diverse knowledge sources from correlated media types and context. In order to combine multimedia knowledge sources, two basic issues must be addressed: what to combine and how to combine. While considerable effort has been ...

متن کامل

Information retrieval on mixed written and spoken documents

While advances have been made in structuring, indexing and retrieval of multimedia documents, we propose to study the unexplored problematics of information retrieval on heterogeneous media sets composed of written and spoken documents. The coverage of modalities in retrieved results seems to be an important part of the user’s information need. We show that this problematic is not satisfied by ...

متن کامل

A probabilistic framework for semantic video indexing, filtering, and retrieval

Semantic filtering and retrieval of multimedia content is crucial for efficient use of the multimedia data repositories. Video query by semantic keywords is one of the most difficult problems in multimedia data retrieval. The difficulty lies in the mapping between low-level video representation and high-level semantics. We therefore formulate the multimedia content access problem as a multimedi...

متن کامل

Towards a Scalable Networked Retrieval System for Searching Multimedia Databases

In this paper the architecture of a distributed and scalable multimedia information retrieval system (Dsmily) is described. The system consists of hierarchically organized networked nodes and is designed to integrate existing dynamic multimedia databases. The document ranking process as well as the preselection of databases to be searched, both tasks are based on a probabilistic model for distr...

متن کامل

Retrieving Information in Distributed Multimedia Databases

In this paper a new model and architecture for information retrieval in a widely distributed hetero-genous multimedia document collection is described. The model generalizes existing probabilistic models for non-distributed information retrieval. The architecture is a conceptual realization of this model. It is hierarchically built in order to provide extendability and scalability and designed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Adv. Sig. Proc.

دوره 2003  شماره 

صفحات  -

تاریخ انتشار 2003